Corpus: eng-bs_web_2013_10K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 91 94 95 95 97
1000 743 929 969 983 989
10000 4223 8094 9174 9626 9751
100000 4223 8094 9175 9627 9752
1000000 4223 8094 9175 9627 9752


Zipf's diagram for sentence endings


Gnuplot diagram

943 msec needed at 2018-04-12 20:13